Language identification of code switching Malay-English words using syllable structure information

نویسندگان

  • Yin-Lai Yeong
  • Tien Ping Tan
چکیده

This paper introduces a language identification approach using syllable structure information. We also review and compare other approaches. Most of these approaches use linguistic information for language identification. The information used for language identification is Malay affixation information, English vocabulary list, alphabet ngram, grapheme n-gram. The approach using syllable structure information has the highest accuracy at 93.73% compared to other approaches. Based on the accuracy result of comparison, by using syllable structure 1.91% accuracy had increased for language identification compare with the second higher result in this paper. Syllable structure information is able to gain a better result for language identification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Syllabification Algorithm based on Syllable Rules Matching for Malay Language

In this paper, we present a new syllabification algorithm for Malay language. Syllabification is the process to extract or divide syllable from words. Syllabification process is language dependent where each language can have its own set of syllable structure. Syllabication is an important component in speech synthesizer, speech recognition and transliteration system. Syllabification algorithms...

متن کامل

Language identification of code Switching sentences and multilingual sentences of under-resourced languages by using multi structural word information

Language identification (LID) is a process to identify the languages used in a text or speech. Code switching is the switching of a language in a sentence or speech utterance. This paper focuses on LID of words in code switching sentences. Code switching can occur intersentential or intrasentential. The reasons why a writer switches from one language to another due to various reasons and among ...

متن کامل

The effect of Code switching on the Acquisition of Object Relative Clauses by Iranian EFL Learners

This study attempted to investigate the impact of teacher’s code-switching on the acquisition of a problematic grammatical structure, namely, object relative clauses, by intermediate EFL learners. Moreover, a secondary objective of the study was to determine the EFL learners’ attitudes and opinions regarding the effectiveness of teacher’s code-switching in their learning of a specific aspect of...

متن کامل

A Comparative Analysis of Word Structures in Malay and English Children’s Stories

Malay is described as an alphabetic language with salient syllabic structures. In our attempt to develop a reading intervention program for early Malay struggling readers, word analysis of Malay children’s stories was conducted. Additionally, in order to have a better understanding of Malay word structures, a cross-linguistic comparison with English was attempted. The results indicate significa...

متن کامل

Language Identification by Using Syllable-Based Duration Classification on Code-Switching Speech

Many approaches to automatic spoken language identification (LID) on monolingual speech are successfully, but LID on the code-switching speech identifying at least 2 languages from one acoustic utterance challenges these approaches. In [6], we have successfully used one-pass approach to recognize the Chinese character on the Mandarin-Taiwanese code-switching speech. In this paper, we introduce ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010